SSO (Search So) is a specialized search engine, meticulously crafted to cater to the unique requirements of mid-sized organizations such as DKU. Our strength lies in delivering rapid, top-notch search results. The entire multifaceted search engine can be deployed with a single instruction and requires minimal maintenance.
SSO is a fully automated, comprehensive search solution. Our mission is to convey information to the relevant audience with minimal strain on the IT department, while offering superior customizability compared to other search engines like Google.
SSO provides:
You may wonder why there's a need for another self-hosted search engine when large publicly accessible search engines like Google exist, and even LLM can be a suitable choice for information retrieval. However, our focus is on delivering search solutions for mid-sized organizations, such as companies, schools, and non-profit organizations. These entities often manage websites and documents that can be challenging to handle without a search engine, and the information that needs to be retrieved may be internal, beyond Google's reach. This is where SSO comes in.
Still questioning SSO's effectiveness? Just try searching for anything you want in DKU's search bar, and you'll grasp why an internal search engine is indispensable.
DKU Search Bar | SSO |
---|---|
If you believe DKU is the only organization struggling with internal information retrieval, you're mistaken!
While students may not always need sensitive internal information, and universities' public data can often be found on Google, company employees face a different scenario. They often find little of the information they need on Google due to companies' privacy policies.
Based on our research, most workers in companies organize their data using platforms like WeChat or Lark. These tools, not designed for information retrieval, often provide only basic text search or title-based document search. While central databases can perform high-quality retrieval, they cannot replace a search engine. It's common for employees to request document sharing simply because they can't find what they need. People are struggling with internal information retrieval! What's more, there's no widely accepted solution to this problem, leading to wasted working hours worldwide in internal searches.
Our target customers encompass a diverse range of organizations across various fields, as long as they face challenges in internal information retrieval. This includes not only mid-sized companies, schools, and non-profit organizations but also individually hosted forums, websites, and online communities. What's more, while some larger corporations may have their own internal search solutions (such as Microsoft), others do not. SSO can step in to fill this gap, offering a tailored search solution that meets their specific needs.
The SSO is powered by three primary components - the Scraper, Data Processor, and Retrieval Engine.
For more technical understanding of sso's mechanism, please check The SSO Doc (QR Code provided).
Elasticsearch: Elasticsearch is the most widely used self-hosted search engine solution, known for its high search quality and rapid speed, capable of handling super-large datasets. However, its complexity and high hardware demands mean that it requires significant IT resources, both in terms of hardware and engineering hours. While offering better scalability, Elasticsearch may be overkill for many organizations. Compared to Elasticsearch, SSO may provide fewer advanced features and not perform as well with enormous datasets, but it is far more user-friendly. What's more, the need for complex features and search capabilities over vast databases is not aligned with the needs of our target customers.
MeiliSearch: MeiliSearch is a more lightweight and user-friendly option compared to Elasticsearch, making it closer to SSO in its approach. It excels in search speed and is particularly focused on searching over local files with properly arranged indexing (for example, searching through movies with associated introductions and ratings). In contrast, SSO emphasizes document and website search, automatically generating all tags and quotations. This makes it more suitable for organizational use, where most documents are stored in pure text with various styles. SSO is specifically designed to search through such content, aligning perfectly with the needs of our target audience.
DKU-Wiki | SSO | SSO Doc |
---|---|---|